Segmentation for Efficient Supervised Language Annotation with an Explicit Cost-Utility Tradeoff
نویسندگان
چکیده
منابع مشابه
Segmentation for Efficient Supervised Language Annotation with an Explicit Cost-Utility Tradeoff
In this paper, we study the problem of manually correcting automatic annotations of natural language in as efficient a manner as possible. We introduce a method for automatically segmenting a corpus into chunks such that many uncertain labels are grouped into the same chunk, while human supervision can be omitted altogether for other segments. A tradeoff must be found for segment sizes. Choosin...
متن کاملExplicit Utility in Supervised Learning
We use a graphical model of the supervised learning problem to explore the theoretical effect of utility in the form of end use and sample cost on supervised learning, No-Free-Lunch, sample complexity, and active learning. There are two sources of utility that can affect the above problems: utility that comes from end use and utility that comes from sample costs. We explore which parts of these...
متن کاملOptimizing the Relevance-Redundancy Tradeoff for Efficient Semantic Segmentation
Semantic segmentation aims at jointly computing a segmentation and a semantic labeling of the image plane. The main ingredient is an efficient feature selection strategy. In this work we perform a systematic information-theoretic evaluation of existing features in order to address the question which and how many features are appropriate for an efficient semantic segmentation. To this end, we di...
متن کاملA Supervised Method for Constructing Sentiment Lexicon in Persian Language
Due to the increasing growth of digital content on the internet and social media, sentiment analysis problem is one of the emerging fields. This problem deals with information extraction and knowledge discovery from textual data using natural language processing has attracted the attention of many researchers. Construction of sentiment lexicon as a valuable language resource is a one of the imp...
متن کاملEfficient Interactive Annotation of Segmentation Datasets with Polygon-RNN++
Manually labeling datasets with object masks is extremely time consuming. In this work, we follow the idea of PolygonRNN [4] to produce polygonal annotations of objects interactively using humans-in-the-loop. We introduce several important improvements to the model: 1) we design a new CNN encoder architecture, 2) show how to effectively train the model with Reinforcement Learning, and 3) signif...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Transactions of the Association for Computational Linguistics
سال: 2014
ISSN: 2307-387X
DOI: 10.1162/tacl_a_00174